Variable Rate Speech Coding Using Discrete Time Waveletextrema

نویسندگان

  • Pramila Srinivasan
  • Leah H. Jamieson
چکیده

We describe a novel procedure for variable rate encoding of a speech signal starting from the discrete time wavelet extrema representation. We describe the bit reduction achievable by thresholding the extrema signal. We also demonstrate that the thresholding procedure provides a \denoising" eeect. We then reduce the bit rate further by using a bit allocation scheme that adopts a model for the extrema representation, based on the evolution of the maxima across scales of the decomposition. The algorithm automatically adjusts itself to speech activity thus lending itself to variable rate environments. Perceptual results are reported in comparison to the GSM 6.1 standard for speech corrupted with road noise. It has been observed that as the SNR falls below 15 db, the wavelet coder performs better.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Context-based adaptive arithmetic coding in time and frequency domain for the lossless compression of audio coding parameters at variable rate

This paper presents a novel lossless compression technique of the context-based adaptive arithmetic coding which can be used to further compress the quantized parameters in audio codec. The key feature of the new technique is the combination of the context model in time domain and frequency domain which is called time-frequency context model. It is used for the lossless compression of audio cod...

متن کامل

Variable bit-rate sinusoidal transform coding using variable order spectral estimation

Sinusoidal transform coding (STC) is known to be capable of producing good communication quality speech coded at bitrates below 4kb/s. Discrete all-pole modelling (DAP) is an alternative spectral estimation method which can be more accurate than the conventional linear prediction (LP) analysis normally used by STC. In the quest to achieve the highest possible speech quality at lower and lower a...

متن کامل

Cycle extraction for perfect reconstruction and rate scalability

A cycle extractor is presented to be used in a speech coder independently from the coding stage. It samples cycle waveforms (CyWs) of the original prediction residual signal at their natural nonuniform rate. It is shown that perfect reconstruction is possible due to the interplay of these properties for two cycle length normalization and denormalization techniques. The coding stage is coupled t...

متن کامل

An efficient and scalable 2D DCT-based feature coding scheme for remote speech recognition

A 2D DCT-based approach to compressing acoustic features for remote speech recognition applications is presented. The coding scheme involves computing a 2D DCT on blocks of feature vectors followed by uniform scalar quantization, run-length and Huffman coding. Digit recognition experiments were conducted in which training was done with unquantized cepstral features from clean speech and testing...

متن کامل

Presentation of K Nearest Neighbor Gaussian Interpolation and comparing it with Fuzzy Interpolation in Speech Recognition

Hidden Markov Model is a popular statisical method that is used in continious and discrete speech recognition. The probability density function of observation vectors in each state is estimated with discrete density or continious density modeling. The performance (in correct word recognition rate) of continious density is higher than discrete density HMM, but its computation complexity is very ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1995